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Ultiaatsly,  th«  prlaary  goal  of  nost  teaching  is  to  ImproTa 
pupil  perforaanca  in  a  Tarlety  of  situetions  by  developing  his  skills, 
kaowledgss,  and  understandings*  In  order  to  deteroine  wiiat  iaproTeseat 
has  been  aade,  it  is  necessary  for  the  teacher  to  observe  the  pupil  in 
each  situation  and  oaks  an  interpretation  of  his  behavior*  Teachexv 
have  tools*- tests-^Mfaich  are  not  really  alternatives  to  teacher  obser* 
vation  of  student  behavior,  but  rather  a  specialized  technique  for 
extending,  refining,  interpreting,  and  recording  those  observations* 

Textbooks  in  msasnreaent  and  evaluation  usually  treat  this  sub- 
ject only  in  general  terras*  Th^   characteristics  which  distinguish  each 
subject  fros  another  asy  differ  widely,  therefore  little  inforoation  is 
available  which  applies  directly  to  high  school  sodern  algebra,  ^us, 
since  aany  schools  are  using  the  modern  approach,  it  would  be  worthiAile 
to  investigate  how  a  teacher  would  develop  classrooa  tests  for  hi|^ 
school  Bodern  algebra* 

^rpose  of  the  Study 

It  %#as  the  purpose  of  this  study  to  (1)  present  the  general 
principles  of  test  construction  as  they  are  coisaonly  found  in  texts  oa 
awasureiBent  and  evaluation,  and  (2)  offer  aethods  of  applying  these 
principles  to  high  school  aodern  algebra* 


aebert  L*  E^el,  HeasurinR  Educti  tional  Achievement  (Englewood 
Cliffs,  Hew  Jersey:  Prentice-Hsll,  Inc*,  1965),  p*  4. 


M«thod  of  Procedttr» 

Thm   method  of  carrying;  out  this  study  of  t«/»t  construction  for 
nodern  algebra  was  reading  and  library  research.  Only  literature  per* 
tainin^  to  clansrooa  testa  in  matheiaatlcs  and  references  on  general 
weasureaent  techniques  which  were  available  in  the  Kansas  State  tTnlvor- 
sity  Idbrary  were  Inyestigated*  "Bie  general  inforaation  was  then 
applied  to  high  school  niodem  alj^ebra  by  the  author* 

Definitions  of  TenM 

Measureiaent.  Throughout  this  report  the  following  definition 

of  eeasuresent,  by  Robert  L«  Sbel,  is  used: 

Measurement  is  a  process  of  assigning  numbers  to  the  individual 
aembers  of  a  set  of  oblects  or  persons  for  the  purpose  of  indicat«> 
isg  differences  among  then  in  the  degree  to  %ihick  they  possess  ths 
characteristic  being  neasured.  If  any  characteristic  of  persons  or 
thinge  can  be  defined  clearly  enough  so  observed  differences  between 
thes  with  respect  to  this  characteristic  can  be  consistently  veri- 
fied, the  characteristic  is  ffieasurable*^ 

Test  and  exaginatlon.  :Rie  terns  test  and  sxaadnatlon  are  ussd 
interchangeably  to  designate  any  device  or  procedure  for  ssasurlng  a 
student's  knowledge  of  the  subject* 

Claesrooa  test*  The  tem  olaasroon  tsst,  for  the  purpose  of 
this  paper,  is  defined  as  any  test  constructed  by  a  teacher  for  uss 
in  his  classrcom. 

ftujg*  The  term  quia  is  used  in  this  report  to  refer  to  any 
claserooa  teat  of  short  duration,  usually  ten  to  twenty  minutes* 


•'^Ibld. .  pp.  454-455, 


aSRERAL  PRINCIPLES  OF  TBSf  COHSTOTCTIOll 

The  general  prlnciplee  of  test  eonstraotion  apply  to  eaaentlally 
all  subjects  being  taught  in  high  school.   The  specific  application  of 
these  principles  to  high  school  modern  algebra  is  not  a  sattsr  of  devel- 
oping new  principles,  but  rather  one  of  using  the  new  approaches  of  ths 
modern  algebra  with  these  general  rules* 

Fucctions  of  Tests 

There  is  no  one  beet  test;  it  is  for  the  test-«aker  to  construct 
the  best  test  for  a  given  purpose*  Testing  is  used  throughout  ths 
school  year  to  aeeoapUsh  different  goals*   Pupils  are  evaluated  at 
the  beginning  of  the  year  in  order  for  the  teacher  to  establish  a  com- 
•on  background  for  the  entire  cl&sa,  and  at  the  end  of  chapters,  units, 
grading  periods,  or  semesters  to  determine  the  ultimate  aohievement  of 
each  student. 

Teaching  is  aimed  at  improving  pupil  performance  in  a  variety 
of  situations  by  developing  knowledge,  skill,  and  understanding,  ^leh 
can  be  successfully  applied  in  life.   Ideally  all  testing  should  be 
directed  toward  this  type  of  pupil  performance,  since  the  pupil  vho 
applies  knowledge  has  aaetered  it  well.  A  pupil  who  has  simply  memo- 
rized the  principles  may  be  able  to  recall  facts  without  the  ability 
to  apply  them.   Teaching  n^lch  is  alaed  at  knowledge,  skill,  and  under- 
standing can  be  evaluated  by  three  types  of  tssts:  an  instructional 


John  A.  Qrsen,  Teacher-Made  Tests  (New  Tork:  Harper  tr  Sow. 
1963),  p.  7. 
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test,  tdilch  cheeks  skill;  a  nsstery  test,  which  eTslaates  the  knowledgs 
of  f undaaentals ;  and  a  nsasoraaent  tost,  which  aeaeuraa  the  pupil's 
understanding  of  concepts* 

The  instructional  test«  Testing  the  student's  knowledge  or 
skill  vd.th  an  Instructional  test  is  aiasd  aore  at  shoving  hia  some  of 
his  strengths  and  weaknesses  than  at  determining  his  grade.  An  instruc- 
tional test  saphaair.es  and  reinforces  the  iaportant  points  of  a  unit 
tdiich  a  pupil  atust  learn  and  revember.  To  be  aeaningful,  with  the  aax- 
Xmm  Isaming  to  result,  the  test  is  likely  to  b«  in  ths  form  of  a  tsn- 
to  twenty-fldnute  quis  %dilch  is  gradsd  by  the  pupils  snd  diseasssd 
immediately  afterwards. 

An  instructional  test  is  usually  neither  a  speed  test,  having  a 
ceiling  in  terns  of  number  of  iteeis  cos  pie  ted,  nor  a  power  test,  with 
the  ceiling  in  teras  of  the  difficulty  of  itests,  since  the  student  is 
supposed  to  know  everything  on  the  test  and  is  given  sufficient  time 
that  he  should  be  able  to  finish.  TtiLa   test  oay  be  used  as  a  teaching 
device  when  the  pupils  have  not  sufficiently  learned  ths  giaterial  froa 
elaseroos  instruction  or  froB  the  hoaewozic.  It  can  be  ussd  to  supple- 
sent  instruction  by  presenting  a  test  on  the  aost  iaportent  areas  which 
Mill  show  each  student  where  he  needs  to  concentrate  farther  study* 

The  mastery  teat.  If  instruction  is  not  to  proceed  so  slowly 
that  pupil  interest  is  lost,  or  advance  so  rapidly  that  many  pupils 
fail  to  aaster  even  the  miniBum  essential  skills,  the  teacher  aust 


Ibid. «  p.  2. 
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PBC9   the  Instruction  to  th«  learnln,?  rate  of  the  najoritj.  The  nastory 
teat  is  the  teacher's  aethod  of  deteralnlni;  this  learning  rate.   In 
order  to  proceed  to  aore  difficult  tasks  or  to  work  based  oo  previous 
knowledge.  It  is  necessary  for  the  teacher  to  determine  when  the  papils 
have  acquired  sufficient  competence  or  skill  to  enable  them  to  continue* 
Since  these  skills  will  be  the  foundation  for  subsequent  vork,  the 
teacher  must  expect  the  students  to  answer  90  per  cent  correctly. 

A  asstery  test  could  be  a  speed  test,  since  the  pupils  tals^t   be 
•xpected  to  wjrk  rapidly  as  well  as  accurately  in  certain  eases.  The 
students  may  have  learned  to  expand  binomiala  by  inspection,  for 
instance,  and  thus  the  teacher  might  want  to  llKlt  the  time  in  order  to 
prevent  students  froa  multiplying  the  binomials  out.  "niere  would  be  a 
■inimua  number  of  problems  that  all  pupils  would  be  expected  to  com- 
plete* with  SKple  time  allotted  for  this  part,  and  additional  problems 
would  be  stfded.   In  this  case,  these  would  be  more  problems  than  any 
one  pupil  could  finish  in  the  given  tirao,  in  order  to  establish  each 
pupil *s  mastery  beyond  the  ralniaua  raqulreraents. 

A  aastery  test  could  be  a  power  teot,  on  the  other  hand,  where 
the  teacher  would  be  checking  the  ultlKete  echievefflent  of  the  pupils 
beyond  the  required  skills.   As  in  the  previous  example,  the  teacher 
would  have  a  ainlaun  nunber  of  exponsiona  which  all  the  pupils  were 
expected  to  complete  and  then  froa  that  point  progressively  harder  or 
more  coaplicated  probleas  would  be  included.  Ro  one  would  be  expected 
to  coaplete  the  ffloat  difficult  problem. 


^Ibid..  p.  2-3. 
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ISxe  meaaurement  test.  Th*   sxaalnatioa  with  which  pupila  are 
sost  faoillar  is  the  aeasureaeot  test*  TUia  is  the  teet  which  purports 
to  give  exact  seaaoreaent  of  each  pupil*s  achieveaent  throughout  the 
school  year,  daring  the  aeaeetdr,  or  at  the  end  of  a  unit.  This  teat 
ia  the  primary  grade  detersiaer  for  each  pupil.  If  it  is  of  the  infor** 
■al  objective  type,  it  hsa  a  50  per  cent  level  of  difficulty—i.e.,  the 
average  pupil  will  answer  50  por   cent  correctly-- and  has  a  power  arrange- 
ment with  the  first  few  iteaa  easy  enough  for  the  slowest  pupil  and  the 
last  it«i8  too  difficult  for  the  brightest.  This  arrangeaent  peroita 
the  teacher  to  establish  a  point  of  oaxiaum  achievement  for  each  pupil. 

Ty;^>ea  of  Tsate  and  Iheir  Pees 

Once  the  fxmction  of  a  test  has  been  determined  the  teacher  ia 
presented  with  the  task  of  choosing  the  right  type  of  teat  to  accofflplish 
that  function.  One  situation  might  require  the  pupile  to  prove  a  par> 
ticular  atatement.  Another  would  call  for  the  recognition  of  terns, 
defiziitioas,  or  problom-solving.  The  teacher  may  %Ant  to  check  quickly 
to  see  if  the  class  has  understood  a  point  Juat  covered,  or  there  «ay 
have  b««B  aoae  practical  application  just  aede  apparent  which  needs  a 
practical  testing  aituation.   Each  of  these  situations  can  be  classified 
into  an  oral,  objective,  essay,  or  perforaanoe  type  test.   Table  I 
gives  a  composite  picture  of  the  relative  strengths  and  weaknesses  of 
each  of  these  types. 


Ibid.,  p.  3, 
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Ibid.,  p.  kt 


TABLE  X 


STRENGTHS  AHD  WEAKRESSES  OF  VARIOUS  TEST  POfiHS 


Btrengthm 


WmOc&*8s*« 


1. 

2. 
3. 

5. 
6. 

7. 


1. 


2. 

3. 

4. 


1. 


2. 


^. 


ORAL 


1*  Olvefi  sxttnaiTe  aeasureaeut. 
2*  Is  useful  aa  an  inatruc* 

tional  device. 
3,  Peneits  teacher  to  give  cues 

to  elicit  desired  responses* 
4*  Isproves  test  rapport  for 

pafiilfl  y/ho  fear  trritten 

•aualaatloas. 


1.  Is  too  tiiBe<>consuKing* 

2.  Besults  in  poor  pupil  perform- 
aace  due  to  lack  of  practice. 

3*  Provides  liaited  saople  unless 
pupils  are  tested  individuall^r. 
k,  la   frequently  poorly  planned. 

5.  le  not  subject  to  refineaent* 

6.  Gives  poor  eoaparative  evalaa* 
tion  of  pupils* 


Qlves  an  extensive  test 

sasple. 

Can  be  aade  highly  reliable. 
Can  be  graded  objectively  and 
quickly. 

Eliffiinates  bluffing. 
Can  be  subjected  to  itea  analy- 
sis and  further  refineaent. 
Can  be  adapted  to  several 
teaching  objectives. 
Can  be  aede  highly  valid  for 
soae  teaching  objectives. 


OBJECTIVE 
1. 


2. 

3. 

k. 

5. 
6. 


ESSAY 


Is  applicable  to  aeasurcaent 
of  tnriting,  organizational 
ability,  and  creativeness. 
Is  easy  to  construct. 
Proaotee  proper  type  of  study. 
Is  adaptable  to  several  eub- 
Ject  fields. 


Stresses  application  of  knowl- 
edge. 

Can  be  used  as  a  learning 
device. 

May  give  a  tiruer  achievement 
picture  for  the  verbally 
handicapped  pupil. 
t4easures  soae  skills  and  abstract 
abilities  not  measured  by  other 
conventional  test  foznas. 


1. 

2. 

3. 
4. 


Frequently  neglects  aeasureawnt 
of  higher  thought  processes* 
Overemphasizes  rote  learning* 
Promot0B   poor  study  habits. 
Bneourages  guessing. 
Is  difficult  to  prepare. 
Costs  aore  than  essay  to  pre- 
pare and  reproduce. 


1.  Gives  a  liaited  test  saaple. 

2.  Is  diffioult  to  grade. 

3»  Favors  the  verbally  inclined 

student. 
4.  Has  low  reliability. 

3.  Encourages  bluffing. 


PERFORMAKCE 


Is  not  adaptable  to  many  fields 
of  learning. 

Is  difficult  to  construct. 
Is  often  difficult  to  grade* 
Is  often  time-consuaing* 


Ibid.,  p.  5, 
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Th«  oral  exaalnatlon.  Oral  quastioning  can  giva  an  axtenalTa 
picture  of  a  pupil*e  knovladga,  but  is  order  to  do  «o  sufficient  time 
■uat  be  devoted  to  the  examination,  and  the  questions  auet  be  carefully 
preiMtred  ahead  of  time  in  order  to  give  an  adequate  sample  of  the  areas 
covered.  Each  exam  would  take  from  one-half  to  one  hour.  Another  form 
of  oral  questioning  is  the  familiar  teaching  device,  i^ich  is  a  meana 
of  stimulating  pupil  participation  in  class  discussion,  where  grades 
are  not  counted  heavily.  Oral  questioning  peraita  the  examiner  to  be 
flexible  in  his  procedure.  He  may  adapt  his  questions  to  fit  each 
pupil,  and  he  can  draw  out  additional  meanings  behind  vague  or  incom- 
plete statetaents.   I^is  is  an  excellent  means  of  following  the  thought 
processes  which  a  pupil  has  used  in  solving  a  problem,  and  a  good  way 
of  diagnosing  pupil  difficulties. 

Many  areas  of  teaching  stress  skills,  linowledges,  and  understend- 
ings,  which  lay  the  foundatioc  for  further  learning  but  for  which  there 
is  little  iauaediate  application.   Claasroom  testa  of  the  essay  or  obJec« 
tive  type  will  give  the  best  estimate  of  learning  in  these  areas. 

The  objective  test.  The  objective  test  is  strong  where  the 
essay  test  is  %Mak.   It  permits  reliable  measurement  of  an  extensive 
amount  of  factual  material,  and  is  the  aost  objective  type  in  scoring, 
hence  the  name.  It  is  difficult  to  construct  objective  tests  iritich 
measure  such  types  of  learning  as  complex  understandings,  creativity, 
attitude  changes,  and  problem-solving.   The  objective  test  has  four 


^Ibid. ,  p.  k, 

2 
Ibid. ,  p.  6« 
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eonaon  foras  trfiich  ttr*  •xt«nsiv«ly  usttd  itt  clsiaarooffl  t*at3}  th«  short- 
answer  fora,  the  al tsroate-rasponss  form,  the  multiple-choice  form, 
and  the  matching  fora. 

The  short-answer  form  of  objective  test  aay  include  questiooa* 
incomplete  sentences,  definitions,  or  identification  iteas.  This  fora 
is  characterized  by  having  its  answers  in  the  fora  of  a  word,  a  number, 
or  as  long  as  a  sentence  in  the  case  of  a  definition.   This  fora  is 
good  for  the  maaaureaent  of  recall,  and  it  is  better  than  the  altemate- 
respoose  and  uiultiple-choice  forms  in  that  the  .student  cannot  guess  the 
correct  response  unless  the  item  is  poorly  constructed.  Short-answer 
tests  tend  to  stress  rote  aemory,  and  the  teacher  who  uses  verbatia 
quoting  will  farther  emphasize  this  weakness.  This  can  be  avoided  by 
rephrasing  the  statement,  which  will  se^ve  to  check  understanding. 

The  alternate-res^.ouse  fora  generally  ooaslsta  of  a  stateiaent 
to  be  judged  true  or  false.  Since  there  are  only  two  choices,  the  stu- 
dent has  a  ^0  per  cent  chance  of  guessing  the  correct  reaponae;  and 
since  he  will  aost  likely  reaeober  aontething  about  each  statooent,  his 
ohanoes  for  guessing  correctly  may  be  increwsed.  Hence  it  is  very 
important  that  iteas  be  without  unnecessary  clues.   This  fora  can  meas- 
ure both  factual  information  and  understanding  of  concepts,  but  in 
order  to  be  valid  and  reliable,  it  aust  consist  of  no  fewer  than  about 
forty  items  and  be  carefully  constructed.   The  test-wise  pupil  may  fur- 
ther Increase  his  chances  of  guessing  correctly  since  teachers  have  a 
tendency  to  select  most  of  the  iteas  directly  from  the  book  and  to 
include  more  trua  than  false  itoos.   The  true-false  type  of  qaestloa 


hbid.,  p.  31. 
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tands  to  penalize  the  brighter  student  since  that  student  aoat  fre« 
quently  thinks  of  the  exception  or  conditional  factors  that  change  the 
■•suing  of  the  qoestion.   It  is  therefore  very  laportant  that  the 
test-maker  make  each  statement  entirely  true  or  entirely  false.   The 
test-ssker  can  avoid  these  asblguous  stateaente  by  aodifying  the  test 
sttsh  that  Mtittn   the  student  aarks  a  statenent  false,  he  aust  tell  what 
would  aake  the  statement  true.   The  true-false  item  is  adaptable  and 
relatively  sssy  to  prepare,  but  it  is  less  disc rioina ting,  itea  for 
Ites,  than  the  multiple-choice  type,  end  somewhat  sore  subject  to  ambi- 
guity and  misinterpretation. 

Ths  ■altipls-eholoe  test  Is  a  good  test  for  seaeuring  a  variety 
of  educational  objectives.  According  to  Robert  L.  £bel,  "Almost  any 
understanding  or  ability  that  can  be  tested  by  means  of  any  other  it«i 
form— short  answer,  completion,  true-false,  matching  or  essay— can  also 
be  tested  by  means  of  multiple-choice  test  items."   Multiple-choice 
items  have  an  incomplete  statement  or  a  question  followed  by  several 
choices  iriiieh  include  ons  correct  answer  and  several  distracters,  all 
of  which  are  plausible. 

The  matching  examination  is  most  useful  for  ■smsaring  recogni- 
tion and  recall.  In  this  test,  a  number  of  homogeneous  questions  or 
statements  are  given  and  a  list  of  between  five  and  fifteen  homogeneous 


Denis  Baron  and  Harold  W.  Bernard,  Evaluation  Techniques  for 
Classroom  Teschers  (New  York:  McGraw-Hill  Book  Company,  1958),  p.  20?. 

2 

Ebel.  o£,  cit. ,  pp.  60-61. 

^Ibid..  p.  149. 


u 

r««pons«e  arranged  for  aaxlBua  clarity  and  coavaniance  to  tha  axamlaaa 
is  provided  to  match  with  tha  pramisaa.  Th9   nuabar  of  raaponaae  is 
limitad,  sinca  for  a  largar  list,  much  tiaa  would  ba  apant  saarehiaff 
for  tha  corract  rasponaa,  aapacially  if  tha  list  wara  to  ba  dividatf 
batwaen  two  pages.   A  ona-to-one  Hatching  batwaan  preaiaaa  and  rasponsas 
should  not  ba  providad,  as  a  studant  nay  ba  abla  to  aliainata  what  ha 
knows  and  hava  a  good  chance  of  guessing  tha  rest  correctly. 

Tha  aasay  test.  Tha  assay  test  is  tha  nost  affactiva  aaans  of 
evaluating  creativanaas  on  the  part  of  a  studant.  It  fflaasuras  such 
non-struoturad  types  of  learning  as  problaa-solving,  creative  thinking, 
writing,  and  organizational  ability.   If  the  pupil  is  to  write  a  proof 
of  a  statement,  ha  auat  be  able  to  supply  the  steps  from  his  own  back- 
ground  of  knowledge,  organize  his  stapa  until  tfaay  are  logical,  and 
supply  reasons  for  his  solution.  Thus  the  nathematical  proof  is  neces- 
sarily classified  as  an  assay  test. 

Tha  essay  test  ia  weak  in  that  it  is  inadaquata  for  sanpllng  a 
great  breadth  of  knowledge.   Due  to  the  aaount  of  time  required  for  tha 
studant  to  organize  and  present  his  answer,  only  a  vary  few  questions 
can  ba  asked.   Another  disadvantage  is  the  subjectivity  of  grading.  In 
order  to  avoid  subjectivity,  it  is  necessary  that  all  poasible  points 
ba  liated  on  the  axaaination  kay,  and  these  aust  be  adhered  to  very 
closely.  This  practice  also  reduces  the  tendency  on  the  part  of  tha 
teacher  to  allow  points  where  a   student  has  tried  to  bluff  and  atissad. 


Oraan,  o£.  eit«,  p.  6. 


Th«  performance  taat.  Knowledge  and  the  successful  application 
of  knowledge  do  not  always  correlate  highly.  Those  vAo  apply  knowledge 
in  successful  perforaance  have  aebieved  a  higher  degree  of  learning 
than  that  represented  by  the  nere  retention  of  knowledge.  Perfoi«anc« 
testing  is  valuable  for  any  subject  in  which  pupils  are  being  taught  to 
follow  specific  procedures  or  to  create  soae  product,  with  its  strength 
being  in  its  ability  to  examine  the  application  of  classroom  learnings 
in  actual  Job  performance.  Because  the  product  and  the  procedure  may 
be  closely  related,  as  in  Industrial  Arts,  the  teacher  interested  in 
performance  measurement  may  choose  to  concentrate  on  the  procedure,  the 
product,  or  on  any  combination  of  the  two* 

Methods  of  Construction 

In  developing  a  high-quality  test,  the  teacher  is  compelled  to 
observe  many  rules  and  many  cautions,  most  of  which  apply  to  all  types 
of  tests.  1!he  job  of  preparing  a  test  need  not  be  laborious,  but  it  is 
obvious  that  soma  advance  attention  must  be  given  to  the  preparation  if 
each  test  is  to  be  of  educational  value.   Tliere  are  essentially  three 
stages  to  developing  a  high-quality  test:  the  planning  stage,  the  con- 
stiruction  stage,  and  the  evaluation  stage. 

•Rie  planning  statue.   Every  teacher  should  be  directing  his 
instruction  toward  a  number  of  specific  and  general  educational  objec- 
tives, ^ey  should  be  stated  in  precise,  operational  terms  to  form  the 
basis  for  planning  a  test,  since  the  measurement  objectives  should  be 


^Ibid. ,  p.  k3. 
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th«  same  as  ths  instructional  objectives.   In  testing,  the  instruc- 
tional objectives  determine  the  choice  of  test  fom!  and  the  type  and 
nusber  of  Iteos.   These  objectives,  along  with  the  content  areas  which 
have  beeii  taught,  form  the  outline  for  constructing  a  good  test^  Since 
all  objectives  do  not  have  the  sane  value,  weights  in  the  form  of  per- 
centages should  be  assigned  to  these  to  indicate  emphasis.   Perhaps  the 
best  way  to  insure  proper  emphasis  in  both  the  objectives  and  the  con- 
tent areas  is  to  construct  a  tdst-specifi cation  chert. 

A  test-epeoificatioB  chart  is  prepared  by  listing  the  content 
areas  and  the  objectives  for  the  unit  to  be  tested.   To  form  a  chart* 
ths  content  areas  are  put  on  the  left-hand  side  and  the  objectives 
across  the  top,  asking  a  two-way  table.   If  such  a  test-specification 
chart  is  used  at  the  beginning  of  each  chapter,  the  teacher  will  have 
in  written  fora,  bis  objectives,  content  areas,  and  an  outline  for 
quizzes  and  the  chapter  test.  Re  can  see  at  a  glance  the  relative 
value  of  each  area  and  guide  his  instruction  accordingly.   If  during  • 
chapter,  the  weights  must  be  changed,  this  should  be  done  on  the  chart 
also,  so  that  when  it  is  tiae  for  a  test  the  measureaent  objectives  and 
the  instructional  objectives  will  still  be  the  saas.   In  Table  II  it 
can  be  noted  that  content  area  W  had  only  one  objective,  D,  which  is 
the  objective  for  no  other  content  area.   This  5  per  cent  of  the  mate- 
rial aig^t  then  be  tested  separately  in  a  short  quiz  to  avoid  having 
nuaerous  different  kinds  of  questions  on  the  unit  test.   In  the  sans 
manner,  content  area  R  has  only  one  objective,  B,  and  again  a  quia 


^Ibid.,  p.  16. 
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■ight  b«  ffiore  practlcBl  for  eTaluetlon  than  sn  additional  area  on  a 
unit  teet.   Another  ieportant  obeervation  to  make  about  Table  II  is 
that  content  area  X  conetitutee  23   per  cent  of  the  totel  content,  and 
objective  B  at  ^0  per  cent,  appears  to  be  the  aost  important  objective 
in  the  unit.  Thus  Table  II  has,  in  generel,  outlined  the  types  of  teats 
and  the  relative  values  of  each  instructional  goal. 

TABLE  II 
TEST-SPECIFICATION  CHART 


CONTEST  AREAS 

OBJECTIVES 

A 

B 

c 

D 

Totals 

S 

0 

10 

0 

0 

10 

s 

10 

0 

10 

0 

20 

T 

5 

10 

5 

0 

20 

TJ 

5 

3 

0 

0 

10 

V 

0 

5 

5 

0 

10 

w 

0 

0 

0 

5 

5 

X 

5 

10 

10 

0 

25 

Totals 

25 

ko 

50 

5 

100 

The  task  of  th«   test~«aksr  is  greatly  rsdueed  if  he  can  writs 
several  test  itess  for  each  content  area,  directed  towards  the  spe- 
cific objectives,  as  he  plans  a  lesson.  This  will  serve  a  dual  pur- 
poss  of  guiding  his  instruction  towards  the  objectives,  and  eliminat- 
ing the  need  to  eoaposs  a  test  from  the  beginning  at  examination  tiae. 
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If  more  itaos  than  needed  are  written  each  tine  and  kept  on  file,  he 
can  chooee  hia  questiona  to  best  suit  the  particular  class. 

The   construction  atajj^.  When  the  test  has  bean  planned  in  thia 
■aijeier  the  construction  of  a  test  ie  in  two  phases:  the  itea<-writing 
phase  and  the  coapiling  phase.   I tea-writing  is  done  when  the  lessons 
are  being  planned  and  conpiling  is  done  wnen  the  test  is  to  be  fdlven* 

In  writing  items  jfor  tests,  »m   far  as  possible,  tetst  for  prin- 
ciples and  generalisations  rather  than  isolated  facts.  The   items 
should  be  clear  and  concise  and  worded  in  such  a  way  that  pupils  are 
not  unduly  penalised  by  reading  speed  and  coaprehension.   Inaccurate 
wording  often  prevents  the  pupil  from  understanding  the  questions  asked, 
and  he  nay  thus  be  unable  to  ansirer  even  those  questions  to  which  he 
would  otherwise  have  a  ready  answer. 

iAien  striving  for  objectivity,  it  is  particularly  important  that 
there  be  no  clues  to  the  correct  answers  other  than  the  meaning  of  the 
questions.   In  the  same  manner,  information  used  in  one  problem  should 
not  provide  a  load  to  another's  answer.  The   items  must  also  be  worded 
such  that  an  answer  which  is  supposed  to  be  correct  is  not  only  unques- 
tionably correct,  but  also  the  only  correct  aamwer.  the   teacher  mast 
be  careful  to  omit  only  significant  words  in  the  incomplete  statement 
items,  and  must  be  sure  to  leave  enough  clues  so  that  the  pupil  who 
knows  the  aziswer  can  give  the  correct  response.  Oramatical  clues  to 
the  correct  answer  onu  over-mutilateu  stateaento  should  be  avoided.   If 
too  many  clues  are  given,  most  students  will  answer  correctly,  wiiile  if 
too  few  clues  ore  given  there  may  be  a  great  variety  of  answers  due  to 
failure  to  understand  the  type  of  response  desired. 


1ft 

Zb  ■altipls-ehoice  items «  the  central  problem  of  the  itea  ahould 
be  stated  so  as  to  make  only  one  choice  Justifiable,  but  all  choices 
should  sees  plausible.  Increasing  the  homogeneity  of  the  choices  makss 
an  item  more  difficult.   The  negative  multiple-choice  item,  as  well  as 
the  negative  true-false  item,  tends  to  measure  reading  ability,  since 
poor  readers  often  respond  to  such  items  incorrectly,  even  vh9n   they 
know  the  correct  an8*fer. 

In  compiling  a  test,  there  are  several  factors  which  oust  be 
considered.   The  test  must  be  planned  to  fit  the  fifty-  to  sixty-ainute 
class  period,  but  should  be  long  enough  to  be  valid  and  reliable,  ^le 
length  necessary  for  validity  depends  upon  the  number  and  complexity  of 
the  objectives  being  measured.   Reliability,  on  the  other  hand,  is 
related  more  directly  to  the  length  of  a  test  than  to  the  objectives, 
and  increasing  the  length  of  a  test  increases  the  reliability  provided 
the  itcns  sire  of  equal  quality.   Poorly  written  items  added  solely  to 
lengthcTh  a  test  may  have  the  effect  of  actually  decreasing  the  relia- 
bility of  the  instrument. 

A  test  should  generally  consist  of  no  more  than  two  or  three 
typss  of  items,  and  all  the  items  of  one  type  should  be  included  in  one 
section.   If  numerous  instructional  objectives  are  being  measured  and 
if  they  need  the  use  of  a  number  of  different  kinds  of  test  items,  sev- 
eral different  tests  should  be  given  at  different  times.  Frequent 
tests  can  sample  smaller  units  of  instruction  and  thus  give  a  more 
reliable  basis  for  evaluation. 


^Ibid.,  p.  36. 
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ttf   first  iteoB  on  a  t«st  should  be  easy  enough  for  all  students 
to  answer.  Success  in  these  first  items  vdll  give  sobs  security  to  a 
student  vho   approaches  the  test  iri.th  a  poor  test  psychology.   The  stu- 
dent who  cooes  to  the  test  in  this  frame  of  mind  may  very  well  wLLmm 
items  which  he  would  answer  easily  were  he  not  in  a  test  situation. 

The  teacher  should  take  care  to  make  the  directions  clear  and 
explicit,  and  there  should  be  a  siaple  method  of  indicating  answers  to 
make  thMi  sasy  to  score.  Each  test  should  be  typed  and  reproduced  by 
••■•  means  so  that  each  pupil  can  have  a  copy. 

A  test  should  be  completely'  constructed  about  one  week  ahead  of 
the  date  that  it  is  to  be  administered.  It  should  be  left  for  several 
days  and  then  examined  again  for  construction  errors  in  the  form  of: 
one  problec  suggesting  the  answer  to  another,  ambiguous  que^ions  or 
atateaents,  and  sore  than  one  correct  answer  to  a  question. 

The  evaluation  stage.  All  measuring  instruments  possess  to  some 
degree  three  important  qualities.   These  are  validity,  which  refers  to 
an  instrument's  truthfulness;  reliability,  which  refers  to  its  consist- 
ency; and  usability,  which  pertains  to  its  practicality. 

Validity  is  extremely  important.   A  test  is  valid  when  it  aeta- 
ally  measures  what  it  is  designed  to  measure.   Bobert  L.  Ebel  lists 
ten  different  types  of  validity  tdtich  "are  not  all  distinctly  different 


Ibid. ,  p.  18. 
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fro*  •«ch  other. "^  Tha  type  of  validity  which  is  of  ajoet  concern  to 
the  teacher  with  each  test  ia  the  content  validity.  For  ciaesroo* 
t«sts,  this  means  checking  the  contents  to  detertaine  the  extent  of 
agraeaent  between  the  aaterial  in  the  teat  and  the  inatructioaal  objec- 
tivea  being  aeasurad*  The  best  ffleans  of  detuntining  the  content  valid- 
ity of  objective  teats  is  to  ooapare  the  test  content  to  the  table  of 
specifications.   If  the  table  has  been  correctly  used  to  construct  the 
test,  there  should  be  very  high  validity. 

Another  iaportant  step  in  the  evaluation  of  objective  tests  is 
item  analysis,  a  check  on  the  quality  of  each  itea  on  the  test.  Since 
it  requires  time,  an  itea  analysis  should  not  be  done  for  tests  which 
are  carelessly  prepared,  and  tdiich,  the  teacher  feels,  do  not  discria- 
Inate  bet  .sen  good  and  poor  pupils.  In  aduition  to  indicating  the 
contribution  of  separate  iteas  to  over-all  validity,  an  itea  analysis 
also  givos  inforaation  on  the  difficulty  of  each  itea  and  the  discria* 
ination  of  each  itoa  between  good  and  poor  pupils.  If  a  higher  per- 
centage of  good  pupils  than  poor  pupils  answer  an  itea  correctly,  that 
itea  sho«»  positive  discriaination. 

For  teacher-aade  teste  the  following  siaple  procedure  gives 
adequate  evidence  for  deteraining  the  quality  and  difficulty  of 
th«  iteas; 

1*  Select  the  top  20  to  30  percent  sad  the  bottoa  20  to  30  percent 
of  the  papers. 

2.  Working  with  one  group  at  a  time,  place  the  papers  on  a  large 
table  overlapping  €£ch  other  so  that  only  the  response  column 
of  each  paper  is  visible. 


Tbel,  0£.  cit«,  p.  381, 
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3.  Count  th«  nnab«r  of  correct  rasponace  on  •ach  it*B  and  convart 
to  percants,  which  are  entered  on  the  raeord  ahaet.  For  axanpla* 
if  18  out  of  20  pupila  in  the  high  group  gat  the  itan  correct, 
90  percent  ahould  be  entered  on  the  record  ahaet  for  the  iten. 

k.   When  the  correct  reaponaea  have  been  counted  for  both  the  high 
and  low  groupa  on  all  itevs,  the  power  of  the  item  can  be  deter- 
mined by  calculating  the  percent  in  both  groupa  who  reaponded 
correctly.  For  example,  tdien  32  of  the  kO   pupila  have  auccaeded 
on  the  item,  the  eaae  index  ia  80  percent.  Or,  stated  another 
way,  the  difficulty  level  ia  20  percent  becauae  20  percent 
failed  the  item. 

3*  Calculate  the  diaerimination  of  each  item  by  uaing  the  following 
formula : 

n   P  •  L 

where  D  ■  index  of  itea-diacriminating  power. 

U  B  nuiU>er  of  pupila  in  upper  group  anawering  item 

correctly. 
L  t   number  of  pupila  in  lower  group  anawering  item 

correctly. 
M  s  number  of  pupila  in  each  group. 

Tha  diaerimination  index  will  range  from  -t-LOO  to  -1.00,  and  only 
iteaa  which  show  poaitive  acorea  will  be  retained.  A   diaorialnatioa 
index  above  4-0.40  ie  deairable.^ 

After  the  item  analysis  is  made,  the  very  eaay  items  which  tfsra 
answered  correctly  by  all  pupils  should  be  discarded,  although  a  few 
say  be  retained  to  uae  at  the  beginning  of  future  teata  to  help  estab- 
lish a  good  teat  psychology  for  the  pupils.  Other  items  which  should 
be  discarded  are  those  which  do  not  discriminate  between  good  and  poor 
pupils  or  those  %<hich  show  negative  diaerimination,  where  the  pooreat 
pupils  are  more  succasaful  than  the  best  pupils.  On  attltipls-choies 
test  items,  there  should  be  a  record  made  of  the  number  of  pupils 
choosing  each  response.   Items  which  discriminate  properly,  but  have 
weak  distractera  can  often  be  rewritten  for  future  use. 


arson,  o£.  cit. ,  p.  88. 


Th«  eozialstency  tdith  which  a  t«8t  ii«asur«8  >diat  it  la  aaasuring 
la  called  reliability.   If  the  teat  la  reliable,  aubaequent  aeaaure- 
■enta  irLth  the  teat  gire  approxiaately  the  eaa*   acore  each  time*  Aa 
the  aoount  of  change  in  indiridual  pupila  will  Tary,  it  ia  difficult 
to  tell  tdiether  acore  differencea  are  a  reault  of  low  teat  reliability 
or  of  marked  pupil  change. 

Soaa  controllable  faetora  which  eaight  increaae  or  decrease  the 
reliability  of  a  teat  would  be  the  length  of  the  teat,  the  level  of 
difficulty  of  the  teat,  and  the  objectivity  of  the  acoring.  Within 
lifflita,  lengthening  a  teat  increaaaa  ita  reliability,  provided  tha 
Itasa  added  are  of  equal  or  better  quality  than  the  original  itama. 
There  ia  a  point,  however,  where  the  teat  becoaea  ao  long  that  fatigue, 
boredom,  or  other  factors  tend  to  reduce  the  reliability.   Short  daily 
quizsea  are  completely  unreliable  unlaaa  the  acorea  are  accumulated  to 
give  one  acore,  ao  that  the  cumulative  teat  ia  long  enough  to  give 
measurement  reliability.   In  the  aarae  manner,  a  trua-falae  teat  of  !••• 
than  about  forty  itema  ia  too  unreliable  to  have  much  meaaurement  use.^ 

The  objectivity  of  the  acoring  affecta  reliability  ainca  the 
objectivity  in  a  teat  ellminatea  the  peraonal  opinion  or  judgement  of 
the  acorer.  If  the  scorer' a  peraonal  opinion  were  to  enter  in,  the 
reliability  would  deoreaae  ainca  hia  opinion  may  differ  greatly  betwaaa 
teat  papers,  being  influenced  by  aueh  fj>>ctor8  aa  the  peraonality  of  the 
examinee,  the  neatneaa  of  the  paper,  or  the  method  uead  to  arrive  at 
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th*  answer  to  th«  qucation*   Th«  reliability  is  thus  greatest  with  the 
objective- type  test. 

Three  different  reliability  coefficients  are  often  obtained  to 
check  the  reliability  of  a  test.  These  are  the  coefficient  of  stabil- 
ity, the  coefficient  of  equivaXenoy,  and  the  coefficient  of  internal 
consistency.   For  each  of  these  coefficients,  the  Pearson  aethod  is 

used  to  determine  the  correlation  coefficient. 

2 


Correlation  coefficient,   Pearson  method: 

r  a 


where:  r  «  sample  correlation  coefficient. 

X.  B  deTietion  from  the  mean  of  the  first  variable. 
Xp  B  deviation  from  the  mean  of  the  second  variable. 

Beliability  may  be  checked  using  the  test-retest  method,  tdiere 
the  same  test  is  administered  a  second  time  after  a  lapse  of  time  of 
from  one  to  six  months.  1%is  method  gives  a  coefficient  of  stability 
since  it  is  based  on  the  stability  of  the  performance  of  a  group  in  a 
retest  situation,  fhe  scores  from  the  first  and  second  administratioiui 
are  then  correlated  to  determine  reliability.  Using  the  Pearson  form- 
«ilm«  X.  is  the  deviation  of  each  student's  score  from  the  msan  of  the 
first  test,  and  x^  is  the  deviation  from  the  mean  of  the  second  test. 
The  coefficient  should  be  at  least  O.85  for  good  test  reliability. 


^Ibid. .  p.  93. 
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A  coefficient  of  •quivalenc*  is  d«t«min*d  by  Adainietering  two 
equivalent  foras  of  the  same  test  to  a  group  at  the  same  time,  with 
oneohalf  of  the  group  taking  each  fom.  With  this  alternate-forss 
nethod  both  the  equivalency  of  the  test  forms  and  the  stability  of  the 
group  may  be  checked  if  the  second  fora  is  administered  sose  time  later 
to  the  sase  group.  The  altemate-foras  aetbod  is  aost  useful  to  the 
person  acteiinistering  the  test.   It  is  important  to  have  «>re  than  one 
fora  of  a  test  with  high  reliability  because,  in  many  instances,  pupils 
have  to  sake  up  the  test  for  having  missed  it  due  to  illness  or  other 
circumstances. 

In  the  split-half  method  of  determining  reliability  there  is  oas 
group,  one  test  form,  and  one  test  administration.  With  this  method 
the  Spearman-Brown  formula  is  commonly  used,  and  the  pupils*  scores  on 
the  odd-numbered  test  items  are  correlated  against  their  scores  on  the 
even-numbered  items,  th*   split-half  method  gives  a  coefficient  of 
internal  consistency  in  that  it  compares  the  rank  and  score  on  one  half 
of  the  test  against  their  rank  and  score  on  the  other  half.  Reliabil- 
ity is  affected  adversely  by  the  shortening  to  one-half. 

Spearman-Brown  formula: 

r    at 


auc   1  ■<-  r 

oe 

i^ere:  r   ^  coefficient  of  internal  cotwisteaoy  of  the  total 

test. 

r   s  coefficient  of  correlation  between  pupils*  odd-half 
scores  and  their  even-half  scores. 


Qresn,  op.  clt.,  p.  93* 
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IQMn  an  Item  snalysie  has  been  run  on  a  test  and  the  difficulty  value 
or  the  proportion  of  pupils  responding  correctly  to  each  itea  has  been 
obtained,  the  Kuder-Rlchardson  forarula  may  be  used  for  calculating 
internal  consistency.  This  villi  simplify  the  number  of  calculations 
necessary  overall* 

Kndor-Richardson  fontula: 


'„  ■  [j^h  -  ^ 


t^eret  IT  «  nuaber  of  iteas* 

p  «  proportion  (per  cent  of  pupils  answering  item 
correctly)* 

q  e  X  ..  p. 

s.  »   standard  deviation  of  total  test  squared. 

For  a  test  to  have  a   high  degree  of  usability,  it  should  be  easy 

to  adainister  and  score,  econosaical  in  tiae  and  laaterial,  and  have  a 

clear  foroat*  The  usability  of  a  test  is  determined  by  inspection,  and 

is  done  be  the  teacher  before,  during,  and  after  the  adainistration  of 

each  examination* 

THE  APPUCATIOH  OF  TEST  CONSTRnCTION  TO  HIGH  SCHOOL  KODEFBI  ALBEBRA 

Nsdara  algebra  in  the  high  school  has  included  several  topics 
which  are  unfamiliar  to  many  students*  Set  theory,  proofs,  and  funda«> 
■aatal  axioms  are  aaed  as  tools  in  developing  the  familiar  concepts  of 
algebra,  and  the  teacher's  testing  program  must  be  designed  with  this 
in  mind* 


■4bid..  p.  96* 


Planning  the  T«gt 

The  developing  of  a  test  begins  the  firot  time  a  lesson  is 
planned  for  the  particular  unit.  In  following  the  rulea  for  teat  con- 
•truction,  the  teacher  must  have  a  nunber  of  specific  and  general  edu- 
eational  objectives  toward  which  the  instruction  is  aiaed.  For  the 
t4Mt«  to  neasiare  these  objectives,  test  questions  should  be  written 
each  time  a  lesson  is  planned,  following  the  test-specification  chart 
to  detemine  the  approxlBate  number  of  questions  needed* 

Objectives.  TiM   teacher's  instractioaal  objectives  should  be 
•tated  in  specific,  operational  teras,  to  font  the  basis  for  planning 
e  teat.  A  different  sot  of  objectives  would  probably  be  developed  by 
every  aatheaatice  teacher,  with  no  list  being  entirely  accepted  by  all* 
In  order  to  have  a  frame  of  reference.  Max  A*  Sobel  and  Donovan  A. 
Johnson  bave  presented  the  following  sample  list  of  objectives  for  as* 
la  any  mathematics  course. 

The  student  should: 

<>ohave  a  knowledge  and  understanding  of  aathsaatlcal  proceasas, 
facts,  cmd  concepts; 

—have  a  skill  in  computing  with  understanding,  accuracy,  and 
efficiency; 

—have  the  ability  to  use  a  general  problaa-solving  technique! 

—understand  the  logical  structure  of  math«Mtlcs  and  the  nature 
of  proof; 

—use  mathematical  concepts  and  proeeaaae  to  discover  new  gener- 
alisations and  applications; 

— recognise  and  appreciate  the  role  of  aathemfltios  In  society; 

—develop  study  habits  essential  for  inde^pendent  progress  in 
mathematics; 

—develop  reading  skill  and  vocabulary  essential  for  progrssa  in 
mathematics; 
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■Midttso&strate  aueh  mental  traits  as  crttativlty,  iisaglnatioat 

eurloait/t  and  visualization t 
—dovelop  attitttdas  that  lead  to  appreciation,  oonfideace, 

respect,  initiative,  and   independence.^ 

This  list,  of  coarse,  ie  not  exhauetive.   It  may  be  poasible  to  set  up 
objectives  which  do  not  fall  into  any  of  the  preceding  categories,  but 
this  list  does  indicate  the  possibilities.  As  an  exanple,  the  objec- 
tives for  the  first  chapter  in  a  co-irse  in  atodern  algebra  might  be  the 
following:  '^ 

^s  student  should: 
—have  a  knowledge  and  understanding  of  oatheoatical  processes, 

facts,  and  concepts { 
—recognise  th9  practical  application  of  specific  asthenatiesl 

processes,  facts,  and  concepts; 
—have  skill  in  computing  with  understanding,  accuracy,  and 
•f ficienoy. 
BmfiMg   thMie  thrse  general  objectives  now  stated  in  writing,  they  oast 
be  used  in  oon junction  with  the  content  arens,  to  determine  the  spe- 
cific, operational  objectives.  For  the  purpose  of  this  example,  the 
content  areas  of  the  first  chapter  in  a  course  in  nodom  algebra  will 
W  the  following: 

OTfIT  COHTENT  AREA 

1  Operations,  relations,  nnd  symbols 

2  Axioms  and  properties  of  numbers 
5        Order  of  operations 


nax  A.  Sobel  and  Ooaovan  A.  Johnson,  "Analysis  of  Illustrative 
Test  Items,"  Evaluation  in  Mathematics,  Twenty-sixth  Yearbook  of  the 
Rational  Ck}uneil  of  benchers  of  Mathematics  (VSashington,  D.  C. :  national 
Council  of  Teachers  of  Mathematics,  19^1),  p.  72. 
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4  -  Znv«rs«i 

5  Algebraic  Bentancee  and  equatlona 

6  Inequalities 

7  Formulae 

S         Definitions 
$  Sets 

Mith  these  sreea  in  wind  the  three  general  objectives  could  be  rewrit- 
ten in  more  precise  tenas  as  follows: 
The  student  should) 

—have  a  worJcing  knowledge  and  aji  understanding  of  the  defini- 
tions; 
—be  able  to  recognise  the  apr>lication  of  the  given  facts  and 

concepts; 
—be  able  to  use  aMtbeastioal  eyabols  in  the  appUoation  to  prob- 
lems; 
—be  able  to  apply  natheaiatical  processes  to  discover  solutions 

to  probleas; 
—have  skill  in  cooputiag  with  accuraey  and  effieienoy* 
In  this  revised  liet,  the  first  objective  is  a  restnteaent  of  the  first 
general  objective.  The  second  and  third  objectives  are  taken  from  the 
second  general  objective,  and  the  remaining  two  objectives  of  this  list 
are  froa  the  third  general  objective. 

The  teat-specification  chart.  With  the  objectives  now  stated  in 
precise,  operational  terms*  the  content  areas  can  be  combined  with  the 
objectives  to  establish  the  test-specification  chart.  Using  the  con- 
tent areas  in  the  left-hand  column  and  the  objectives  across  the  top« 


a? 

the  two-way  table  i»  foriMd.   Th*  oimrt  i«  then  compl«t«d  by  choosing 
▼alu«6  for  each  pair  of  objectivas  and  content  nr^ae*     Table  III  ia  th« 
tast-apecifleatina  chart  thus  fornad« 

A  teacher  planning  for  this  chepter,  would  be  able  to  see  iBBBe- 
diately  that  unitn  two,  three,  foar«  and  five  are  going  to  require  the 
ffloat  eaphaeia,  since  the  total  percentagMs  are  greatest  in  theae  units* 
He  can  also  notice  in  the  same  rsanner,  that  "be  able  to  apply  natheoMt- 
ical  processes  to  discover  solutions  to  problMw**  is  the  tsost  important 
objective*  Unit  eight,  definitions,  has  only  on*  objective  and  eaa 
thus  be  easily  made  into  a  short  neanureiaont  test,  or  quiz,  avoiding 
the  necessity  to  have  an  addition«l  \uiit  on  the  chapter  test.  Unit 
•even  and  unit  one  eonprising  only  k   per  cent  «md  7  p«r  cent,  respec* 
tively,  of  the  total  night  also  be  batter  tested  with  short  aeasureaent 
tests*  In  the  same  nanner,  the  objective,  '^ave  a  working  knowledge 
and  an  understanding  of  the  definitions'*  has  only  10  per  cent  of  the 
total  and,  in  order  to  limit  the  siee  required  for  the  chapter  test* 
this  objective  might  be  tested  with  one  or  more  short  measurement  tests* 
Choices  for  each  of  these  possibilities  rest  solely  upon  the  teacher's 
judgement  in  the  particular  situation* 

la  previewing  the  chapter,  the  teacher  might  expect  the  students 
to  have  extra  difficulty,  for  instenee,  in  units  two,  four,  six,  and 
nine*  MLth  this  in  mind,  he  could  then  plan  to  give  instructional  tests 
during  the  study  of  any  or  all  of  these  units*  Since  the  grades  would 
not  count  hecvily,  this  test  would  not  preclude  the  necessity  for  a 
measureaiettt  test  over  the  same  material. 

At  the  end  of  units  two  end  five  a  mastery  test  is  a  possibility 
since  the  pupils  %d.ll  have  to  know  certain  parts  of  e^ch  of  these  to 
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proceed  to  the  units  wMch  follow.  This  test  would  most  likely  be  a 
short  test,  covering  very  little  material,  and  one  in  which  scores  are 
high*   Since  the  first  four  units  constitute  about  half  the  chapter, 
according  to  the  relative  weights,  the  teacher  might  want  to  have  two 
large  ffleasureaent  tests,  depending  upon  the  importance  of  this  chapter 
in  relation  to  the  entire  course.  Here,  again,  a  teat-specificatioa 
chart  for  the  entire  course  would  prove  instrumental  in  determining  the 
relative  importance  of  each  chapter. 

At  this  point,  it  %K>uld  be  good  to  have  a  list  giving  a  clear 
and  brief  outline  to  the  teacher  of  the  tests  he  is  considering,  with 
the  relative  weights  of  each  unit  listed  as  a  guide  to  determining  the 
value  of  the  tests.  The  following  list  would  be  representative  of  the 
example  being  used. 

PNIT  CONTENT  AHEA 

1  Operations,  relations,  and  symbols 

2  Axioms  and  properties  of  numbers 

5    Order  of  operations 

h  Inverses 

9    Algebraic  sentences  and  equations 

(     Inequalities 

7     Formulas 

S     Definitions 

$  Sets 

Since  too  many  tests  are  indicated  here  to  be  able  to  include  all  of 
them,  it  is  necessary  for  the  teacher  to  judge  which  tests  to  include 
largely  by  the  specific  material  and  the  particular  classes  involved. 


TEST 

WEIGHT 

Measurement 

7 

Tnfltruetional 

20 

and/or  mastery 

Nona 

x» 

Instructional 

15 

Kastery 

X7 

Instructional 

f 

Measurement 

* 

Measurement 

5 

Instructional 

20 

50 

Constructing  th«  T»»t 

Wh«n  th»  Icseone  have  b««n  planned  by  using  the  objectiTes  and 
content  areas  to  detemine  the  relative  values  of  each,  the  task  of  the 
test  construction  is  siaplified  considerably,  I teas  for  the  test  are 
iirltten  as  each  lesson  is  planned,  using  the  test-specification  chart 
••  a  guide  to  the  nuaber  and  type  of  questions  needed.  At  testing  time, 
then,  all  that  rsMins  is  to  eonpile  the  questions  into  the  necessary 
test  and  reexaaine  each  itmi  for  iten-writing  errors. 

Pie  item-writing  phase.  Three  types  of  items  are  written  for 
tests,  corresponding  to  the  three  types  of  examinations.  These  are  of 
the  oral,  objective,  and  essay  types,  as  previously  mentioned.  The 
performance  test  which  was  also  mentioned,  is  primarily  used  in  the 
fields  of  Home  Economies,  Industrial  Arts,  and  Business,  where  a  eoor* 
dinated  performance  connected  with  a  product  is  involved.  It  is  not 
Intended  to  replace  a  panel 1-and-paper  test,  but  is  intended  rather  to 
test  those  areas  which  a  pencil-and-paper  test  will  not  adequately 
evaluate. 

In  analyzing  the  test-specification  chart,  it  can  be  seen  that 
neither  the  content  areas  nor  the  objectives  require  the  particular 
characteristics  of  an  oral  examination.  The  oral  questioning  technique, 
on  the  other  hand,  would  be  an  excellent  instructional  device  where  ths 
instructional  test  has  been  indicated  on  the  list  of  tests.  With  the 
use  of  the  oral  questioning  technique,  grades  are  not  counted  heavily, 
since  the  purpose  is  that  of  instruction  and  not  measurement.  Referring 
•gain  to  Table  III,  the  objectives,  "have  a  working  knowledge  and  an 
understanding  of  the  definitions,'*  and  '^be  able  to  recognize  the 
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application  of  tha  given  facta  and  conoapta,"  %rould  appaar  to  fit  most 
readily  into  the  oral  questioning  technique.  Having  theae  two  refer- 
eneea  aa  guidea,  oral  questioning  ia  indicated  for  unite  two,  four« 
aix,  end  nine,  with  only  one  objective  being  teated  in  unite  two,  aix, 
and  nine,  and  both  objectivea  being  teated  in  unit  four. 

Unit  t%#o,  axioHs  and  proper tiea  of  nunbers,  would  lend  itaelf 
very  well  to  oral  queationing.   For  inatance,  an  axion  which  aays,  "if 
the  aaae  quantity  ia  added  to  equal  quantitiea  of  the  aane  kind,  the 
mxm0   are  equal,"  can  be  teated  orally  tcLth  probleaa  which  are  aimed  at 
having  the  student  recognize  the  axiom* a  applicationa.  The  axiom  can 
be  illuatrated  %d.th  thia  problem t 

nAMPLE  1. 

If  two  girla  each  have  five  dollara  and  they  receive  two  dollars 
apiece  for  babyaitting,  they  each  now  have  aeven  dollara.  This 
is  an  example  of  which  axiom? 

Another  item  for  oral  quentioning  would  be  the  following: 

EXAMPLE  2. 

If  the  side  of  one  aquare  ia  equal  to  the  side  of  another  aquare, 
do  the  squarea  have  equal  areas?  If  two  aquarea  have  equal 
•rsss,  is  the  side  of  one  square  equal  to  the  side  of  the  othert 
If  one  number  equals  another,  are  their  squarea  equal?  Are  their 
cubes  equal?  Uhat  is  the  axiom  illustrated  by  these  problems? 

Questions  such  as  this  on  axioms  and  properties  of  numbers  will  permit 
the  teacher  to  draw  out  the  full  meaning  of  the  axiom  «^en  the  student's 
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aa«w«r  is  V8gu«  or  Incoapletd.  Oral  queetioning  in  this  area  will 
asalst  the  teacher  in  dlagnoaing  mlsunderstandinga  which  the  pupils 
have  about  particular  rules.   If  the  axiom,  '^the  aaiae  powers  of  equal 
quantitiee  are  equal,**  were  bolng  aought  by  the  teacher,  and  the  answer 
a  pupil  gave  was,  "the  rjowers  of  equal  quantities  are  equal,"  a  basic 
■isunderatanding  is  apparent.  The  teacher,  at  this  point,  could  give 
a  counter«»exaaple  and  proT^ably  elicit  the  correct  answer.  Sone  obvious 

problecss  which  could  be  used  to  point  out  the  error  in  the  student's 

2  5  2  5 

Statement  would  be,  "Is  a  equal  to  a  ?<*  and  "Is  2  equal  to  2  7" 

?or  this  unit,  on  axioms  and  properties  of  nuabero,  the  "dis* 
covery  method"  could  bo  quite  profitably  used.  In  Example  2,  if  the 
final  question  were  changed  to  "What  rule  oould  you  sake  from  these 
illustrations?",  the  pupils  would  be  trying  to  discover  an  axiom  lAieh 
they  would  encounter  in  the  next  lesson  in  the  text* 

The  use  of  lengthy  oral  examinations  in  mathematics  would  necss* 
sarily  be  limited,  but  as  a  questioning  techniqae,  some  measurement  is 
possible.  The  greatest  value  is  as  an  instructional  device,  i^sre 
grades  are  not  counted  heavily. 

Referring  to  Table  III,  the  test-specification  chart,  any  of 
the  content  areas  and  any  of  the  objectives  eould  b«  tssted  using  the 
objective  type  test.  Each  of  the  four  common  forms  of  the  objective 
tsst,  the  short-answer,  the  alternate-response,  the  multiple-choice, 
and  the  matching  form  is  applicable  at  some  point  la  the  chart  and  can 
be  used  at  the  discretion  of  the  teacher. 

The  short-answer  fora  nay  include  questions,  definitions,  incom- 
plete sentences,  or  identification  items.  An  example  of  the  definitloa 
item  would  be  the  following  illustration: 
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XXAMPLE  3* 

A*  null  e«t 

fi.  solution  0«t  o£   an  algebraic  aentenca 

This  typa  of  itaa  borders  vrj   closely  to  the  essay  test  in  the  diffi- 
culty of  scoring  objectively.  A   written  answer  of  this  nature  would 
hav«  to  have  a  strict  grading  •ystea,  in  order  to  have  objective  soor- 
lag;  there  would  have  to  be  one  point  for  each  thought  the  teacher 
expected  the  student  to  express.  For  instance,  in  Sxaaple  3,  for  the 
null  set,  there  might  be  two  points  sought: 
Point         Definition 

1  A  set 

1  having  no  mead} ere. 

and  for  the  solution  set: 

Point  Definition 

1  The  sst 

1  of  all  numbers 

1  which  make  an  algebraic  sentence  true. 

!Rieee  definition  items  could  easily  be  considered  as  question  items, 
by  rewriting  them  in  thim  ammi^rs  ^ 

EXAMPLE  k, 

Miat  is  the  definition  of  e   "null  set"? 

EXAMPLE  5. 

Mbat  is  the  "solution  set"  of  any  algebraic  sentence? 


34 
Another  example  of  the  question  fopw  is  the  type  of  question  which  ••!» 
for  only  one  or  two  specific  worde.  Illuetretione  of  this  would  be: 

EXAMPLE  6. 

Mhat  property  eaye  that  the  sub  of  any  two  ntuBbera  is  a  number? 

EXAMPLE  7. 

What  ia  the  inverse  of  the  operation  "squaring"? 

EXAMPLE  8. 

What  is  the  name  of  the  set  haring  no  menbers? 

The  incomplete  sentence  form  of  the  objective  test  is  quite  sim- 
ilar to  the  question  form,  in  that  there  is  usually  only  one  or  two 
specific  tiords  vAiieh  will  correctly  answer  the  itea.  The  iacomplsts 
sentence  can  be  used  also  in  each  of  the  above  questions  in  the  follow- 
ing manner: 

SXAMPL5  9. 

"The  sum  of  any  two  numbers  is  a  number"  is  an  example  of  ths 

_______________^  property. 

EXAMPLE  10. 

The  inverse  of  squaring  is  called  . 

EXAMPLE  11. 

The  name  of  the  set  having  no  rsembers  is  the  __________  set. 

Or  in  another  example: 

EXAMPLE  12. 

By  the  transitive  axiom,  if  a  ■  b  and  b  =  c,  then  . 
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Aiiotuer  ahort-aii0\4*r  t/p«t  cloeely  r«lat«d  to  th«  incomplete 
seutenoe  forci,  la  the  idantiricatioa  foria.   This  form   is  uost  practical 
in  itotts  whsro  a  figure  can  be  shown,  with  each  of  the  parte  labeled 
which  are  to  be  identified.   There  is  oue  particular  case  where  an  iden* 
tification  item  would  be  useful  in  the  chapter  outlined  on  Table  III: 
the  nuaber  line.   An  identification  item  would  pemit  the  teacher  to 
test  the  student's  knowledge  of  the  aaaes  of  portions  of  the  nunber  line 
without  having  to  write  separate  questions  for  each*  It  could  be 
designed  like  this: 

EXAMPLE  13. 

Identify:  ^^" 


where  the  answers  are: 

a.  integers 

b.  whole  numbers  or  non-negative  integers 
e.  aegative  integers 

d.  natural  numbers  or  positive  integers 

Also  coming  under  the  classification  of  the  short-answer  form 
is  the  familiar  problem-solving,  where  the  student  is  to  derive  a  spe- 
cific numerical  answer  from  a  given  problem.  Again,  as  in  the  essay 
question,  the  problwR  arises  as  to  how  much  partial  credit  to  allow  for 
a  partially  correct  solution.   If  the  problem  is  short  and  relatively 
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0iupl«,  there  laigat  b«  one  point  given,  that  being  for  the  correct 
«insw«r.  iiowev^r,  wnen  the  problea  is  more  complex,  a  point  should  be 
allo^fed  for  specific  steps  of  the  problem  which  the  student  must  obtain 
to  have  the  corx'ect  unswer.   Consider  the  problem  3x  s  1^.   In  this 
problem,  there  would  be  little  value  in  allowiu^  more  tbaa  one  point. 
In  the  following  problem,  however,  there  would  be  points  given  for  cer- 
tain steps. 

EXAMPLE  1^. 

Let  Z  s  jintegersl. 


If  xe  Z,  find  |x|2(x  ♦  *♦)  »  5x  +  2  -  xj. 


1 

2(x  *  h)  m  Zx  *  6 

1 

5x  -  X  -  2x  =  2x 

1 

8-2.6 

1 

1-3 

Points  for  this  problem  could  be  distributed  as  follows: 
Points  Steps  Solution 

2(x  ♦  4)  «  5x  ♦  2  -  X 
2x  +  8b5x-*-2-x 
8  «  2x  ••-  2 
6  «  2x 

3  *  X,  or  X  a  3 

Thus,  this  problem  may  bsooms  a  four-point  problsa.  The  student  muat 
•xpand  the  term  with  parentheses,  he  must  collect  both  the  x-teras  and 
the  constants,  and  ha  must  divide  by  two  to  obtain  the  result.  This 
type  of  problem-solving  would  probably  be  the  best  examination  form  to 
use  for  objectives  having  to  do  with  skill  in  eoaputing. 

The  altern!)to-reaponae  form,  since  it  is  normally  used  where 
there  ara  only  two  possible  answars,  will  find  ver;f   limited  use  in  nath- 
enatics.   From  Table  III,  a  unit  which  mi^jht  be  tested  by  oeans  of 
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tru«-falB«  questions  is  unit  two,  axioMS  and  propsrtiss  of  numbers. 
In  this  case,  a  modified  true-false  item  where  the  student  must  correct 
the  underlined  word  if  the  Btatement  is  false,  would  be  best  since 
there  are  numerous  axioms  and  properties,  and  not  just  two  choices. 
Examples  of  the  nodified  true-false  questions  would  be: 

EXAMPLE  15. 

The  statement  2  =  2  is  an  example  of  the  symmetry  axiom. 

EXAMPLE  16. 

If  2  "  ^%   and  4  =  <?,  then  2  »  ^.   This  statement  is  en  exaapls 

of  the  replacement  axiom. 

Example  13  would  be  false  until  "symmetry**  is  cbanf^ed  to  '^reflexive.'* 
Erample  16  would  be  corrected  to  the  "transitive"  axiom.   It  is  impor- 
tant to  notice  that  only  the  important  words  have  been  underlined, 
since  these  are  the  words  the  student  must  learn  and  rem«rt>er.  Modi- 
fied alternate-response  items,  such  as  these,  require  the  student  to 
be  able  to  recognize  the  falsity  of  the  statement  and  to  supply  ths 
word  which  will  correct  it.   True-false  items  are  not  as  discriminating, 
item  for  item,  as  the  other  objective  test  forms  and  are  somewhat  more 
subject  to  ambiguity  and  misinterpretation. 

The  multiple-choice  item  is  more  difficult  to  prepare  than  the 
alternate-response  form  but  is  good  for  measuring  a  variety  of  educa- 
tional objectives.   The  multiple-choice  items  have  the  most  instruc- 
tional value  if  the  choices  are  the  answers  a  student  would  obtain  by 
making  certain  common  errors.   These  choices  can  be  illustrated  in  this 
manner : 
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EXAMPLE  17. 

7h«  «r«a  (in  square  Inches)  of  the  region  bounded  by  a  circle 

with  a  radius  of  3  inches  is: 

A.  6  f/ 

B.  9fr 

C.  18  /T' 

D.  56  ^ 

ir  A  is  the  circumference  of  this  circle,  Ztfr^   which  could  not  be 
in  square  inches.   ^Riis  fact  is  not  likely  to  be  e  specific  determiner 
that  would  elioinate  this  choice,  since  students  frequently  confuse  the 

circuaference  foimula  and  the  area  formula  and  tend  to  ignore  the  units 

2 
of  measure.   Choice  B  is  Ziir   ,  which  is  a  cross  between  the  area  foraula 

and  the  circusfeisnc-  fonsula.   Choice  C  is  correct,  and  D  is  r/d  trfiere 

2  2 

d  has  been  substituted  for  r  in  the  correct  formula.  An  exaaple  in  a 

unit  on  sets  would  be  the  following: 

EXAMPLE  18. 

If  M  »  -j^l,  2,  3,  ^j-  and  N  »  /j,  a|,  what  does  M  UH  equal? 

A.  {l,  2,  3.  ^ 

B.  {3,  'f} 

C.  {1,  2,  3.  ^.  3,  ^} 

D.  {1,  2,  3,  ^]  ♦  {5.  ^} 

E.  {17} 

Choice  A  is  correct;  B  is  M  nNj  C  has  a  duplication  of  like  terms,  a 
aistake  often  made  by  students  when  first  studying  modern  algebra;  D  is 
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M  ♦  Hi  and  K  le  found  by  edding  l*Z*3+^*3*^*     Another  exampl* 
would  be  the  following,  where  the  wrong  choice  indicated  the  student's 
■Isunderstanding  of  a  definition* 

EXAMPLE  19. 

If  one  quantity  equals  a  second  quantity*  then  the  second  equals 

the  first.  This  statesent  is  the  ..„_..,,.._^  axios, 

A.  power 

B.  reflexive 

C.  synunetry 

D.  transitive 

B»e  Batching  fora  of  objective  test  is  most  useful  for  seesuring 
recognition  and  recall.   This  can  best  be  illustrated  by  using  part  of 
a  list  of  naaes  and  syabols  found  in  the  unit  on  sate  from  Algebra  One 
by  Qlen  D.  Vannstta.   It  should  be  noted  thet  the  names  and  syabols  of 
Bxsaple  20  are  for  use  in  the  specific  text  cited  and  are  not  universal 
In  every  instance. 

Several  distracters  are  included  in  this  list  of  syabols,  such 
ma   ''not  con.tmned  in,"  "contains,"  and  "universal  sst."  Thus  a  studsnt 
•ay  choose  a  as  the  answer  to  2  and  H  as  the  answer  to  3«  or  h«  mmj 
choooe  I  ae  the  answer  to  2  and  J   as  the  answer  to  3t  since  either  of 
these  pairs  is  consistent  in  form. 


Glen  D.  Vannatta,  A.  Wilson  Goodwin,  and  Harold  P.  Fawcett^ 
Algebra  One;  A  Modern  Course  (Columbus,  Ohio:  Charles  E.  Merrill  Books, 
Inc.,  1962),  p.  50. 
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SXAMPLE  20. 

Matching: 

Haat* 

Symbol 

1.     8«t 

A. 

S 

2.     Meoibar 

B. 

u 

3.     Not  a  member 

C. 

U 

k.     Is  contained  in 

D. 

n 

5*     Set  notation 

E. 

(  } 

6.     Solution  set 

F. 

(  } 

7.     Union 

G. 

€ 

8.      Intersection 

H. 

^ 

I. 

C 

J. 

^ 

K. 

3 

Generally  speaking,  the  objective  teat  will  be  the  aost  commonly 
used  test  in  any  mathematics,  Uue  to  the  adaptability  to  the  subject 
and  the  ease  in  scoring.  However,  the  essay  form  is  becoming  of  more 
importance  in  modern  algebra  due  to  the  increasing  emphasis  on  proofs. 
A   proof  is  necessarily  classified  as  an  essay  item  since  the  student 
must  supply  the  steps  from  his  otm  background  of  knowledge  and  organise 
them  until  they  are  logical. 
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SZAMPLE  21. 

Prove  that  (a  +  b)  ♦  [(-a)  +  (-b)]  =0. 
Proof: 

(a  ♦  b)  +  [(-a)  +  (-b)]  »  0  Oivan 

[(a  •*-  b)  ••■  (««}]  •«-  (-b)  3  0  Aasociativ*  property 

[<-•)  ♦  (a  ♦  b)J  ♦  (-b)  B  0  Ck>m]iut8tiTe  property 

lljt"*)  ♦  al  ♦  b|  >♦•  (-b)  o  0  Assoclstlve  property 

(0*b)+(-b)=o  Inverse  property 

(0)  +  [b  ♦  (-b)J  «  0  Aseociative  property 

[b  ♦  (-b)j  m   0  Identity  property 

0  «  0  Inverse  property 

<he  coapilinK  phase*   If  the  teat  iteos  have  been  written  whan 
laaaoBS  were  plauned,  the  task  remaining  is  to  compile  these  Iteas  Into 
a  test.  Since  testing  is  done  for  various  purposes,  the  compiling  must 
be  Aimed  at  satisfying  one  of  the   three  test  functions:  instruction, 
■astery,  or  flieasuresent. 

In  compiling  the  instructional  test  for  unit  nine  outlined  in 
!teble  III,  ten  multiple-choice  items  might  be  best.  These  items  should 
be  specifically  designed  to  include  the  wrong  answers  which  pupils  will 
obtain  if  they  make  any  of  the  most  coraeon  errors,  in  addition  to  the 
one  correct  answer.  This  one  example  is  not  Intended  to  show  the  only 
possible  type  of  test  to  be  used  here,  but  rather  to  illustrate  the 
fact  that  the  items  must  be  chosen  to  satisfy  a  particular  purpose. 

A  mastery  test  for  unit  five,  since  the  students  are  expected  to 
answer  90  per  cent  correctly,  would  probably  include  only  problem-solv- 
ing items.  Students  would  be  given  equations  and  aakad  to  solve  for 
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the  value  of  the  variable,  or  given  t#rltten  probleae  and  aeked  to  apply 
equations  to  the  aolving  of  the  problema.   In  either  caae,  the  intended 
purpose  of  the  test  would  be  to  aecure  the  teacher  as  well  ae   the  atu- 
dente  that  the  ekllls  in  the  unit  had  been  mastered. 

The  meaaurenent  teat  ia  the  test  which  la  the  primary  grade- 
deterainert  aince  it  purporte  to  Beaeure  a  atudent'a  learning  for  th« 
grading  period.  The  abort  tneaaurenent  teat  will  not  be  very  reliable 
unleas  the  scorea  ere  accumulated  throughout  the  period  to  give  one 
total  Bcore  approxirap tely  equivalent  to  a  one-hour  teat.  Teat  itMHl 
on  the  meaaurement  teats  encosipaae  all  of  the  types  discuaaed  in  the 
iteR-writing  phaae. 

Bvaluating  the  Test 

Once  the  teat  has  been  E-diainiatered,  the  results  should  be  used 
to  evaluate  its  effectivenecr.   All  «eoBuring  inatrumenta  posceaa  to 
some  degree  three  lai:ortant  quail  ties:  validity,  reliability,  and  usa- 
bility.  Ilieae  qualitiea  apply  whether  the  test  is  in  mathematics  or 
any  other  subject,  and  the  procedures  and  formulas  are  also  the  same. 

If  a  teacher's  testing  program  is  to  laprove  from  year  to  year, 
the  itema  which  have  been  used  on  each  teat  nust  be  analyzed  and  either 
revised  or  discarded.  Items  which  should  be  discarded  ere  those  vrhich 
do  not  dlsci^ainate  between  good  and  poor  pupils  or  thone  vAieh  show 
negative  discrimination,  whare  the  poorest  pupils  are  more  succesaful 
than  the  best  pupils* 
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SUMMAST 

Th«  t«ach«r*s  Inetructional  purpoeee  will  gen«rally  fall  into 
thraa  categoriaa  of  test  runctions.   A  last  may   function  as  an  inetruc- 
tional taot,  vdiich  chackB  skill;  a»  a  ■aetery  tast,  which  •▼alustas  a 
pupll*8  knowledga  of  fandamentala;  or  as  a  ■•amtraaaat  taat  which  saaa* 
urae  bis  understanding  of  concepte.   'Riese  test  functions  are  aacb 
•atisfied  by  using  any  of  four  types  of  teats:  the  oral,  objective, 
essay,  or  perfornance  test. 

There  are  three  stages  neceasary  to  develop  a  high-qoality  tentt 
the  planning  stage,  the  construction  stage,  end  the  evaluation  stage* 
In  planning  a  test,  the  instructional  objectives  and  the  content  areas 
together  make  up  the  test-specification  chart,  which  will  detsrnd.ne  the 
ohcioe  of  test  f^rn  and  the  t3rpe  and  number  of  iteas*  The  oonstructioa 
of  tt  teat  is  usually  handled  in  two  phases:  the  i ten-writing  phase  and 
the  eoopilic^  phsse*  The  evaluation  stage  involves  an  analysis  of  the 
test  results  to  serve  as  a  basis  for  improving  the  ite«s  on  the  teat. 
This  is  necessary  in  order  that  they  may  be  revised  on  testa  which  are 
given  in  subsequent  years.  The  tools  which  are  used  for  the  analysis 
are  validity,  which  z>efers  to  the  testes  truthfulness;  reliability, 
which  refers  to  its  consistency;  and  usubility,  which  pertains  to  its 
practicality. 

The  general  principles  of  test  construction  apply  to  ••••ntially 
all  subjects  being  taught  in  high  school.  The  epocific  application  of 
these  principles  to  high  school  sodern  algebra  is  not  a  natter  of  devel- 
oping new  principles,  but  rather  one  of  using  the  new  approaches  of  the 
nodern  algebra  in  writing  individual  items. 
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Modem  alg«bra  in  th«  high  school  has  inclndsd  8«t  theory, 
proofs,  and  axioms  fros  «Aiich  th«  faalliar  concepts  of  algebra  ar* 
devalopad.  !%•  discovery  method  is  used,  where  a  student  will  eolve 
pirobleflw  and  front  these  state  a  rule,  and  this  saee  method  can  be  used 
on  instructional  testa  to  give  extra  aid  to  the  student* 

A  carefully  planned  testing  program  will  not  only  detensine  a 
student's  grade,  but  will  al^o  aid  the  teacher  in  hia  test-fflaking  task 
in  future  years*  With  careful  analysis  and  revision  of  itea»  a  teach- 
er** claserooB  tests  will  contribute  additional  instruction,  and  will 
continue  to  Increase  in  effectiveness  froa  year  to  year* 
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It  1M0  th«  purpoflia  of  this  atud.y  to  (1)  present  the  general 
principles  of  teat  construction  as  they  are  comreonly  found  in  texto  on 
aeasurem«nt  and  evaluatf-on,  and  (?.)   offer  method*  of  applying  theM 
principles  to  hii^h  school  modem  algebra. 

•Rie  method  of  carrying  out  this  atudy  of  test  construction  for 
modern  algebra  was  reading  and  library  research.  Only  literature  p«r» 
talnlng  to  classroom  tests  in  mathematics  and  references  on  general 
•aasurMient  techniques  which  were  available  in  the  Kansas  State  Onlvcr- 
•ity  Library  were  investigated.  The  general  information  was  then 
applied  to  high  school  modern  algebra  by  the  author. 

!rhere  is  no  one  bost  test)  it  la  for  the  test-maker  to  construct 
the  best  test  for  his  Instructional  purposes.  The  teacher's  instruc- 
tional purposes  will  generally  foil  into  three  categories  of  test  func- 
tions. A  test  may  function  as  an  instructional  test«  which  checks 
•kill;  am  *  mastery  test,  which  svaluates  a  pupil**  knowledge  of  funda- 
mentals; or  as  a  measurement  test  which  msasuree  his  understonding  of 
concepts* 

%ese  test  functions  are  each  satisfied  by  using  any  of  four 
types  of  tests:  the  oral,  objective,  essay,  or  performance  test.   Each 
of  these  types  of  teste  must  be  considered  for  its  strengths  and  weak- 
tt««s«s  as  well  as  for  the  function  it  is  to  serve.   The  oral  test  is 
valuable  in  that  it  improves  test  rapport  for  pupils  who  fear  written 
examinations,  but  its  weakness  is  that  it  is  too  time-consuming.   The 
objective  test  can  be  adapted  to  several  teaching  objectives,  and  etlll 
be  mode  highly  vnlld  and  reliable.   Its  disadvantage  is  that  it  is  dif- 
ficult to  prepare.   The  essay  test,  although  it  favors  the  verbslly 
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inclined  student  and  is  difficult  to  grade,  l*  ««qr  ^  construct  and  it 
doee  pzx>Bote  the  proi>«r  tyi>e   of  study.  V/here  eoae  skills  and  bbetract 
abillticB  not  meeeured  by  other  convontlon&l  teet  forme  netd  to  be 
messurbd,  the  perforoance  teet  is  in  order.   This  teet  is  not  adapt«bl« 
to  oany  fields  of  leernicg  and  it  is  often  difficult  to  grede. 

There  sre  three  et&ges  neceerw^ry  to  develop  e  high-quality  tests 
the  planning  stnge,  the  construction  etoge,  &nd  the  evaluetion  etege* 
In  pl&nning  a  test,  the  Inetructionel  objectives  ano  the  content  areas 
together  neke  up  the  tept-epeclflcAtlon  chert,  which  trill  detemine  th« 
choice  of  test  form  and  the-  type  end  number  of  iteaie.  The  construction 
of  a  test  is  usually  handled  in  two  ph&ses:  the  item-writing  phace  and 
the  cocpiling  pheee.  Item-writing  is  not  such  a  tedious  tdck  if  the 
teacher  will  write  severtil  teet  items  every  time  he  plans  a  lesson. 
This  servec  a  dual  purpoec  of  directing  the  testing  towt>rd  the  instruc- 
tional objectives,  and  of  easing  the  difficulty  in  developing  a  test  at 
the  end  of  s  unit.  The  test  can  then  be  coapiled  from  these  previously 
written  itens  and  exeeined  for  comu&on  itea-writiag  errors.   Ths  evalu- 
ation stags  involves  an  enalysis  of  the  test  results  to  servs  as  a  basis 
for  isproving  the  items  on  the  test.  This  is  nscesaary  in  ordsr  that 
they  may  be  revised  on  tests  which  are  given  in  subsequent  ysars.  Ths 
tools  vhich  are  used  for  the  analysis  are  validity,  which  refers  to  ths 
test*8  truthfulness!  reliability,  which  refers  to  its  consistency;  a&4 
usability,  which  pertains  to  its  practicality. 

The  application  of  these  princlplea  to  modern  algebra  involves 
developing  the  instructional  objectives  in  specific,  operational  tei 
These  objectives,  together  with  the  content  areas  of  a  unit  form  ths 
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basis  for  the  tost-speclfication  chart,  which  vdll  detemine  the  typ« 
of  test. 

It  should  be  noted  that  in  modem  alf^ebra  special  attention  ia 
given  to  sets  and  to  the  proofs,  axloas,  and  postulates  «dbieh  fomerly 
have  been  limited  to  geonotry.   The  test  items  in  modern  algebra  ars 
aimed  at  fostezdng  these  concepts  by  testing  ths  applications  whera 
possible.  Modem  al|^bra  has  prssanted  a  naw  approaeh  to  an  old  high 
school  subject  and  it  follows  that  the  classroom  teats  mast  present  a 
corresponding  approach  if  the  testing  prograa  is  to  aid  in  tha  total  ,, 
aducation  of  the  individual  pupils. 

A  carefully  planned  testing  prograa  will  both  determine  tha 
pupila*  grades,  and  assist  tha  teacher  in  iaproving  his  test  iteets  for 
future  years.   With  a  good  analysio  and  revision  of  iteois  a  teaehar*B 
elasarooffl  tests  will  contribute  additional  instruction,  and  will  con- 
tinue to  increase  in  effectiveness  from  year  to  year. 


